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NATIONAL ADVISORY COMMITTEE FOR AERONAUTICS 
TECHNICAL MEMORANDUM 1404 

INVESTIGATION CF APERIODIC TIME PROCESSES WITH AUTOCORRELATION 

AND FOURIER ANALYSIS * 

By Marie Luise Exner 


SUMMARY 

Autocorrelation and frequency analyses of a series of aperiodic t ime 
events, in particular, filtered noises and sibilant sounds, were made. 

The position and band width of the frequency ranges are best obtained 
from the frequency analysis, but the energies contained in the several 
bands are most easily obtained from the autocorrelation function. 

The mean number of zero crossings of the time function was deter- 
mined from the curvature of the latter function in the vicinity of the 
zero crossing, and also with the aid of a decimal counter. The second 
method was found to be more exact. 


1. INTRODUCTION 


For the analysis of voice sounds, practically the only method ap- 
plied up to this time has been the Fourier analysis of the time process 
according to frequency. For vowels and other noises with predominantly 
periodic components, the analysis of the sound is undoubtedly the most 
suitable for the problem. The voiceless consonants are, however, only 
slightly periodic, and it is to be expected that the autocorrelation 
analysis will give a more suitable description. Attempts of this kind 
of analysis have already been made by Stevens (ref. l) . 

The autocorrelation function § (t) of a time process f(t) is 
defined by 

r+T 

§(t) = lim / f(t) f(t±T) dt (1) 


^''Untersuchung unperiodischer Zeitvor gauge mit der Autokorrelations- 
und der Fourieranalyse . " Acustica, Bd. 4, Nr. 3, 1954, pp. 365-379. 
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where f ( t±T) is the time function for an earlier and later time instant 
t, respectively. The autocorrelation function is, thus, a simplified 
form of the time function in which all phase relations are neglected, 
and is related to_t he original time function in the same way that the 
power spectrum |a(v}/2 is related to the spectral function X(v) (in 
magnitude and phase) . 

According to Wiener, who introduced the autocorrelation function in 
information theory, 1 the theorem, named for him, holds that the power 
spectrum and the autocorrelation function of a time function form a pair 
of Fourier transforms. Fundamentally, it is thus not to be expected that 
the autocorrelation analysis should yield more than the Fourier analysis. 
However, in any individual case, the investigation of a time function 
according to either method of analysis may be more advantageous. 

The autocorrelation analysis offers an advantage, for example, in 
the analysis of statistical processes for which only the probability 
distribution is known (ref. 3). These processes are not directly acces- 
sible to the theoretical treatment with the Fourier analysis, whereas, 
with the aid of the ergodic hypothesis (equality of the ensemble and time 
average of a statistical process) , the autocorrelation function can be 
directly computed from the probability distribution. From this function 
and through a Fourier transformation, the frequency spectrum of the sta- 
tistical process is obtained. This property of the autocorrelation func- 
tion as a mediator between the probability distribution and the frequency 
spectrum of a statistical noise is not applied, however, in the present 
work. 


Since the autocorrelation analysis of time functions represents a 
somewhat unfamiliar mode of treatment, it appears suitable first to meas- 
ure the autocorrelation functions of simple noises and compare them with 
the measured Fourier spectra before we begin to analyze complex noises 
(e.g., sibilant consonants). The present investigations are correspond- 
ingly divided into the following four groups: 

(A) Investigations on filtered noises 

(B) Investigations on frequency modulated impulses 

(C) Investigations on a noise on which a periodic component is 
superposed 

(D) Investigation of voiceless and voiced sibilant and fricative 
consonants 


■'■A review of the properties and applications of the autocorrelation 
function is given In a work by Lee and Wiesner (ref. 2). 
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Several properties of the autocorrelation function may he stated in 
advance : 


(a) The autocorrelation function is a symmetrical function. Its 
maximum value lies at T = 0, and is equal to the time-averaged square of 
the time function, that is, 

§ (0) = f 2 (t) (2) 

(b) The autocorrelation function of a sine or cosine function is a 
cosine function of equal period. The autocorrelation function of a sta- 
tistical noise is an exponential function which drops more rapidly as the 
noise frequencies become higher. In general, it can be stated that the 
wider the frequency band of a time function the more rapidly its autocor- 
relation function drops with increasing t. 

(c) If the noise contains various periodic and aperiodic components, 
each one begins at t = 0 with its maximum value and then, depending on 
its frequency -band width, drops more or less rapidly. Independently of 
the other components. In this way periodic components, for example, can 
be sif ted out of an aperiodic noise. For t = 0, the energy contribution 
f 2(t) of each component can be observed. 

(d) The curvature of the autocorrelation function at the zero cross- 
ing is connected with the mean number of the zero crossings per sec- 

ond of the initial function: 

p 0 = k o ^ #(o)^ > k o " * ^ 

(see refs. 4 and 5). 

(e) For the mean number of maxima and minima per second of the time 
function the following expression applies: 


p m 


* 




-«F(o) 



( 4 ) 


2. COMPUTATION OF THE AUTOCORRELATION FUNCTION OF A FIITERED NOISE 


The autocorrelation function of the filtered noise can be computed 
by two different methods (refs. 3 and 5). The first method makes use of 
the definition of the autocorrelation function (eq. (l)), and the second 
method makes use of Wiener’s theorem. 
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(a) The time function, which is to "be correlated with itself, is 
the received function of the filter if the filter is excited by a sta- 
tistical noise with constant spectral amplitude (white noise) . This 
noise can also he regarded as a statistical, infinitely dense, sequence 
of S impulses. When excitation of the filter by a single 5 impulse 
occurs, the received function is equal to the so-called weighting function 
W(t) of the filter, which is connected with its transfer function Y(v) 
through a Fourier transformation 

W(t) = / Y(v)e J2,tvt dv (5) 

00 


The autocorrelation function of the output voltage is, in this case, 
according to equation (l) : 


(t) 



( 6 ) 


It is shown (ref. 3) that equation (6) holds true not only for the exci- 
tation of the filter by a single 6 impulse, but also for the excitation 
by a statistical sequence of 5 impulses (i.e., for noises). The auto- 
correlation function of the filtered noise can thus be computed if the 
transfer function of the filter is known. 


(b) The Wiener theorem represents a relation between the autocorre- 
lation function and the spectral density of a time function. 

Let the time function be fqi(t) in the region -T<t<+T and zero 
outside this region. Then its Fourier transform, the spectral function 
is 


Aij(v ) 


dt = jQ T at 


( 7 ) 


The spectral density is defined as 

G(v) = lira ilA^v)! 2 __ (8) 

-+• oo 


Wiener's theorem states that 


$ 



G(v) cos 2:tvTdv 


( 9 ) 
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that is, the autocorrelation function and the spectral density are con- 
nected hy a Fourier transformation. 

The spectral density of the output of a filter, whose input is a 
white statistical noise of spectral density N, can "be computed from the 
transfer function 

G(v) = nJy(v) I 2 (10) 

For the computation of the autocorrelation function of filtered 
noises the following scheme is thus obtained: 

| Y(v) 1 

(1) W(t) G(v) (2) 

I §( T ) 1 

In the first case, we go from the frequency into the time plane 
through Fourier transformation, and all phase relations are then elimi- 
nated; in the second case, the phases axe first dispensed with in the 
frequency range, and than a Fourier transformation is carried out. 

In the present work it is suitable to apply the first method for the 
RC and LC filters, and the second method for the rectangular filters. 


Examples 


1. RC filter 

The transfer function of the RC filter is read from the circuit 
diagram (fig. 6(a)) as: 


Y(v) 


l/,12rtvC l/RC 

R + 1/J2*VC j2itv + l/RC 


(H) 


The weighting function is then given by the Fourier transformation as 


W(t) = ^ e-V 110 (12) 

According to equation (6), there is obtained for the autocorrelation 
function 


$(t0 


lim 

T -* co 


— 1 _ 
2T (iH)^ 



h l/RC 


dt 


(13) 



6 


NACA TM 1404 


or 


#(t) = $(0)e"^^/ RC with. - $ (0) = W^(t) 
Similarly, there is obtained for the LC filter (fig. 7(a)), 


Y(V) = 


JL/UL 


(J2rtv + a) 2 + P‘ 


#(t) = #(0)e -a ^ T lcos p(t| 


(14) 


(15) 


(16) 


where 6 is the natural frequency of the damped circuit and in a all 
the losses of the filter are included. The autocorrelation function is 
thus an exponential damped cosine function. 

2. Rectangular filter 

Transfer function: 

Y(v) = A for v Q - Av/2<v <v 0 + Av/2 
Y(v) =0 elsewhere 


(17) 


The spectral density of the filtered noise, according to equation (10) 
is 


and 


G(v) = NA 2 in the transmission region 


G(v) =0 elsewhere 


(18) 


According to the Wiener theorem (eq. (9)), there is obtained for the 
autocorrelation function 


rv 0 +Av/2 

<i>('c) = NA 2 / , cos 2itvT dv 

j v q -Av/2 


(19) 


= 


HA" 


cos 2 «VqT 


sin tiAvt 


( 20 ) 


The autocorrelation function of the rectangular filter is a cosine func- 
tion whose amplitude does not decrease exponentially but decreases like 
a slit function. 
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For the rectangularly hounded low pass filter with the limiting fre- 
quency there holds, with Av = 2 Vq = Vg^, 

•(T) > (2!) 


3. APPARATUS 


(A) Autocorrelator 

An apparatus was developed which made it possible, from a given time 
function f(t), to determine the autocorrelation function according to 
equation (l) . The block diagram of the apparatus is shown in figure 1. 

The values of the time function were recorded on an endless magnetic 
tape, about 1 meter in length, which could receive speech up to 95 percent 
of its length. The interval between recordings was determined by the 
distance of the erase head from the record head. The frequency -response 
curve of the magnetic tape is seen in figure 2, where the ordinate has a 
linear scale . This record was obtained by recording a very broad banded 
noise on the magnetic tape, and analysing the played back noise with a 
wide-range spectrum analyzer (ref. 7). The response is uniform up to 
about 16 kilocycles per second, and then drops rapidly. In measurements 
with pure tones and for the noise investigations, the time function was 
directly obtained without using a storage from the vibrating buzzer or 
the noise generator. 

A properly terminated lumped-constant transmission line was employed 
as a delay mechanism. The characteristics of the transmission line were: 

C = 48,900 micromicrofarads and L = 5.99 millihenries, from which a 
characteristic impedance of Z = 350 ohms and a time delay per section 
= 17.2 microseconds, was obtained. The measured limiting frequency 
was 25 kilocycles. The damping of the transmission line for 50 sections, 
giving a time delay of about 850 microseconds, was 1.2 decibels at 2000 
cycles per second, and up to 1.5 decibels for higher frequencies. In 
order to equalize the frequency dependence produced by passing the signal 
through the delay line, the undelayed voltage was conducted through sev- 
eral sections of delay line having an equal impedance; in addition, 
negative delays could then also be realized. At a maximum, the time 
function could be delayed by 2.5 milliseconds and by -0.014 millisecond. 

Multiplication . - Most of the multiplication procedures that have 
become familiar are eliminated when the following requirements are set: 
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(1) The product should he zero if a factor becomes zero (e.g., not 
satisfied in multiplicative mixture with a hexode). 

(2) A true four-quadrant multiplication should be carried out, that 
is, the sign of the two factors should be taken into account (not sat- 
isfied in the case of all logarithmic and impulse modulation processes) . 

(3) The multiplier should not simultaneously integrate with a fixed 
time constant (not satisfied, e.g., with dynamometers, electrometers, 
and thermoconverters) . 

(4) The frequency response of the multiplier must be from about 50 
cycles per second to 15 kilocycles per second. 

(5) Independence of the multiplication on the input voltages of the 
multiplier (not satisfied when operating in the quadratic region of tube 
characteristic curves) . 

All the preceding requirements are satisfied, in a very simple man- 
ner, by a ring modulator whose four rectifier elements have very accurate 
equal characteristic curves. Elements corresponding to each other were 
selected from a large number of rectifiers. 2 The curve in figure 3, 
which simultaneously represents the characteristic of the entire auto- 
correlator, shows that the product voltage $(0) is within a wide range 
proportional to the product of the two (in this case equal) input voltages 
f 2 (t) . Using this measurement, the modulation of the multiplier is 30 
millivolts. For larger modulation, the product <f> (0) indicated is too 
small. Small asymmetries of the rectifier can largely be equalized by 
reversing the poles of one of the two voltages on the multiplier, and 
then taking their average. The integration was carried out with an RC 
element whose time constant could be readily varied. Generally, a time 
constant RC = 50 milliseconds (R = 5k2, C = lOpF) was used. In these 
measurements, therefore, we are dealing with an autocorrelation with 
limited integrating time (i.e., a short-time autocorrelation) (ref. 6). 

Indicator mechanism . - The very small direct voltage delivered by 
the integrator is proportional to the autocorrelation function, and can 
be directly measured with a very sensitive galvanometer. In order to 
circumvent the inconvenient galvanometer measurement and make it possible 
to register the measurement results automatically, the direct voltage was 
broken up with a relay to the beat of the net power supply frequency. 

From the resulting rectangular voltages the fundamental frequency was 
filtered out with an octave band width filter. After corresponding am- 
plification, this alternating voltage was conducted to a cathode -ray 
oscillograph on whose screen, and on synchronization of the time deflec- 
tion with the supply frequency, it was possible to read the autocorrela- 
tion function in magnitude and sign. Because of the greater reading 


2 


The multiplier was set up by S. Vogel. 
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accuracy, however, the magnitude of $ (t) was generally read on an indi- 
cator instrument connected with the measuring amplifier. 


Testing of Autocorrelator 


1. Frequency Dependence 

The frequency dependence of the entire autocorrelation apparatus 
(without the storage magnetic tape) is shown in figure 4. The response 
is flat up to about 10 kilocycles per second. The waviness at the higher 
frequencies is due to the delay circuits, whose wave resistance is no 
longer constant in the neighborhood of the limiting frequency. 

2. Autocorrelation of a Sine Oscillation 

The continuous curve in figure 5 shows the computed autocorrelation 
function of a sine oscillation of 1940 cycles per second; the points shown 
are the measured values. The agreement for small t is very good; for 
large t the damping of the delay circuit becomes appreciable, for which 
however, a correction can be made if the measured values are plotted on 
a logarithmic scale. 

3. Autocorrelation of a Periodically Interrupted Time Function 

Since for all speech investigations the time function was periodi- 
cally interrupted because of the gaps on the magnetic tape, it was ascer- 
tained, with the aid of a sine function impressed on the tape, that the 
apparatus also operated reliably in this . case . 


(B) SOUND ANALYZER 

For the sound analysis, the wide-range spectrum analyzer of Tamm and 
Pritsching (ref. 7) was applied. It contained a mechanical filter of 15 
cycles per second band width and a very sharp cutoff. The analysis time 
amounted to 150 seconds for the frequency range of 0 to 20 kilocylces per 
second. 


(G) DECIMAL COUNTER 

To determine the zero crossings of the time functions, an electronic 
decimal counter was employed. 3 In this apparatus, for each zero crossing 
of the input voltage, an impulse is released by a trigger and is counted 


^Developed in the III. Physical Institute of the University of 
Gottingen by H. Henze. 
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in the usual manner by the multivibrator principle. The apparatus can 
count about 2x10® zero crossings per second in a uniform series of cross- 
ings and, therefore, possesses a time -resolving power of 0.5*10"® second. 
This resolving power is also sufficient to take care of the irregular 
sequence of zero crossings of a statistical noise exactly, providing a 
frequency range of 20 kilocycles per second is not essentially exceeded. 

In statistical processes, the measured number of zero crossings is 
somewhat dependent on the input voltage of the counter. If the maximum 
between two neighboring crossings Is so small that the trigger does not 
respond, only one impulse is released. On the other hand, for very large 
input voltages, the counter can he blocked for a short time so that too 
few zero crossings are indicated. For these reasons, in the case of each 
noise, the dependence of the number of crossings on the input voltage of 
the counter was measured, and the measurements were considered reliable 
only if the indication remained constant over a large range of Input 
voltage . 


4. MEASUREMENT RESULTS 

(a) filtered noise 

As a noise source, a wide band amplifier (Rohde and Schwarz), which 
supplied a uniform noise extending far above 20 kilocycles per second, 
was employed. 

Two groups of filters are distinguished: (l) the usual RC and LC 

filters and (2) the so-called "Ideal filter" whose pass band is limited 
(e.g., octave, one-third octave, and low-pass filters). In the case of 
the rectangular filters, the envelope of the autocorrelation function has 
the form of a slit function sin aT/ax, whereas In the case of the LC 
filters, it is an exponential function. 

For each noise the following measurements were carried out: 

(a) Measurement of the frequency response of. the filter with pure 
tones 

(p) Spectral analysis of the filtered noise with the wide-range 
spectrum analyzer 

(r) Autocorrelation analysis of the filtered noise 

(5) Determination of the average number of zero crossings per 
second of the time function with the decimal counter 
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1. Noises over RC Filter 
From figure 6(a), 

R =■ 16 k2 
C = 4.5*10" 9 F 
RC = 72x10“® s 

Spectrum curve of the noise: 

|.A(v) | = VVCL + ( v / v gr) 2 v gr = 1 / 2ltRC “ 2210 C P S 
Autocorrelation function: 

4(t) = $(0)e" ^/ RC 

Measurements : 

(a), (6) The filter response curves determined with pure tones and 
the spectral analysis of the filtered noise, give a limiting frequency 
of 2220 cycles per second. At higher frequencies up to 25 kilocycles 
per second, there occurs the normal drop with l/v, while still higher 
frequencies are completely cut off by the delay circuit. 

(y) In figure 6 the measured autocorrelation curve Is shown for 
linear and logarithmic scale. It is essentially an exponential function 
with a time constant RC = 80 microseconds. This value corresponds to a 
limiting frequency of 2000 cycles per second; the deviation from the 
theoretical value thus amounts to about 10 percent. The reason for the 
flattening of the autocorrelation function at t =* 0 is that the fre- 
quencies above 25 kilocycles per second are entirely absent (see (a)). 
From the circle of curvature at the zero crossing there is obtained, for 
the average number of zero crossings, according to equation (3), a value 

of 8.5*10'-’ sec -1 . This number is very uncertain, since the measurements 
could not be carried out for a sufficiently small value of T. 

(5) The counter measurement gave a value of 6. 2x10 s sec”^- for the 
average number of zero crossings. 

Measurement results on additional RC filters^ whose time constants 
embraced a range of 1:50 are presented in table I. 


-The author wishes to thank P. D&mmig for his help in carrying out 
the RC filter measurements. 
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Results : 

A noise that has passed through an RC filter can he conveniently 
analyzed with the autocorrelator. The time constant RC of the filter 
is, with the given apparatus, determined to about 10 percent error. It 
is important, however, that no phase shifts occur between the filter and 
the multiplier; otherwise, the autocorrelation curve is falsified. A 
cutting off of the high frequencies between the filter and the multiplier 
evidences itself in the flattening of the autocorrelation curve at the 
point t = 0, rather than possessing a finite slope at this point. 

2. Noise Through LC Filter (fig. 7(a)) with Various Amounts of Damping 

L = 4.9 henries 

C = 630 micromicrofarads 

Vq = 2865 cycles per second 

The loss factor of the coil was very small (t^ 5=1 R r /2rtVQL = 0.06) . 

The loss factor of the condenser was varied by parallel connecting of 
resistances Rp (tiqc = l/Rp2rtVQC = 0.22; 0.44; and 1.10). The frequency 

curve of the noise is 



1 

£1 - (2kv) 2 I£ + Rr/^pJ 2 + (2«v) 2 LC(t 1ol + q^) 2 


and possesses, as is obtained through differentiation, a maximum at v r , 
the resonance frequency of the filter: 


v r " v 0 V 1 " \ (^0L + ^Oc) 
The autocorrelation function is 

#(t) = # (0)e _a M cos 0 |t| 



where 


4191 



4191 


MCA TM 1404 


13 




3 = 2jtVj is tlie angular frequency of the free damped oscillations of the 
filter. The frequency 


= v 


0 



agrees with the resonance frequency v r only in the case of very small 
damping. 


For the average number of zero crossings per second of the filtered 
noise there is obtained, according to equation (3) 



2v 


r 


Measurements : 


Rp * 400 k2 

= 0.22 

The over-all loss factor of the filter is 


T!q = 0.22 + 0.06 = 0.28 

(a) From the filter response curve measured with pure tones (fig. 
7(a)) there is obtained a resonance frequency v r = 2820 cycles per sec- 
ond (theoretical value 2820 cps) , and a half -value width Av = 870 cycles 
per second. The measured loss factor is thus t\q = 0.30. 

(8) The spectrum (fig. 7(b)) of the filtered noise shows the same 
picture) however, the resonance frequency and the half -value width cannot 
be read with the same accuracy as in the filter measurement with pure 
tones . 

(y) From the measured autocorrelation function (fig. 7(c)) there 'is 
obtained a frequency of 2740 cycles per second (theoretical value 2850 
cps), and a damping q = ln#(0)/(3t In $ ^) =0.28 (theoretical value 

likewise = 0.28). The damping of the transmission line is taken into 
account. The determination of the zero crossings of the time function 
from the radius of curvature of the autocorrelation function at T = 0 
is very inaccurate. There is obtained approximately n = 4000 sec-1. 

(5) The determination of the number of zero crossings per second with 
the decimal counter gave n = 5050^60, or expressed as a frequency, 
v = 2525±30 cycles per second (theoretical value, 2820 cps) . 
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Measurements for another loss factor of the LC filter are presented 
in figure 8. It Is clearly seen in what manner the spectrum curve and 
the autocorrelation curve depend on each other. The wider the frequency 
curve , the more damped is the autocorrelation function. 

Table II lists all the experimental results that were obtained with 
LC filters of different damping and serves for quantitatively comparing 
the individual measuring methods. This table also shows that the natural 
frequency and damping of LC filters may be determined with the given 
autocorrelator within about 15-percent error. Only for the most strongly 
damped filter, t}q « 1.2, do larger deviations occur. This was found to 

be true for all filters whose frequency range reaches to very low fre- 
quencies and is evidently due to the phase shifts within the amplifier 
at very low frequencies. 

3. Noise Through Rectangular Filter 

Frequency response curve of the filter: 

y(.v) « A for v Q - Av/2<v < v Q + Av/2 


and 


Y(v) « 0 elsewhere 

If N is the constant spectral density of the noise, then 

G(v) * n|y( 0)| 2 « NA 2 for v Q - Av/2< v<v Q + Av/2 

G(v) a 0 elsewhere 

The autocorrelation function, according to equation (20), is 


R(t) a cos 2rtV n T sin rtAvT 

The following three filters were investigated: 

(a) Octave filter vq “ 4800 cps, Av a 3200 cps 

(b) One-third octave filter Vq a 2850 cps, Av a 700 cps 

(c) Low pass filter Vgj* «« 2 vq « Av a 16 heps 



4191 


NACA TM 1404 


15 


(a) Noise through octave filter 

(a) Figure 9(a) shows the frequency response curve of the filter 
measured with sine tones. To determine the half -value frequencies 6305 
and 3100 cycles per second, the decimal counter was used as a frequency 
meter. There was obtained vq ■ 4702 cycles per second and Av * 3205 
cycles per second. 

(p) The spectral analysis of the noise is shown in figure 9(b). 

The frequency response curve of the filter is also well reproduced in 
this manner. 


(r) The continuous curve in figure 9(c) is computed from formula 
(20 ) for Vq ** 4800 cycles per second and Av r* 3200 cycles per second. 

The measured points (the crosses indicate -t and the circles indicate 
+t) agree quite well with the computed curve. As a measure for the damping 
of the autocorrelation curve, the ratio of the first maximum to the first 
minimum may be used. For an octave filter, there is theoretically 
obtained $(o)/4» mia =* 1.2^ the measured value was 4>(o) n »« 1.3. From 

the radius of curvature at the zero crossing, there is computed a mean 
number of zero crossings of 10.8x10^ see"^-, corresponding to a frequency 
of 5400 cycles per second. The determination of the radius of curvature 
is not very accurate, however. 


(5) The counting of the zero crossings with the decimal counter gave 
n » (I0.0±0.2)xl0 3 , that is, v a* 5.Ca£*l kilocycles per second. The 
comparison with the theoretically predicted value 


n 



9.8x10^ sec“l. 


v b 


v 0 + Av/2 
v Q - Av/2 


shows good agreement. 

(b) Noise through one-third octave filter 

(a),(p) The response and spectrum for the one -third-octave filter 
show (figs. 10(a) and (b)) that although the cutoff of the filter was 
sharp, its top was rounded. With the decimal counter the f r e queue ie s 
of the half -value points were determined as 3230 and 2500 cycles 
per second. From thiB there is obtained a mean frequency vq » 2865 

cycles per second, and a band width Av *« 730 cycles per second (theoret- 
ical values 2850 and 700 cps). 

(y ) The continuous curve in figure 10 (c) represents the measured 
autocorrelation curve. It drops somewhat more rapidly than the ideal 
computed autocorrelation function for a rectangular filter in figure 
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10(a), the maxima and minima of which are plotted as crosses in figure 
10(c) (the damping of the transmission line was taken into account). 

For conparison, the extreme values of the autocorrelation curves for a 
resonance circuit of equal mean frequency and half-value width are in- 
dicated by triangles. The curve drops considerably more rapidly than 
the measured autocorrelation function of the third filter. 

(6) With the decimal counter, a mean number of zero crossings of 
n * 5690,±30 sec - ^ was measured; this corresponds to a frequency of 
2845115 cycles per second. If the values measured under (a) of 
Vq « 2865 cycles per second and A v * 730 cycles per second are used as 

a basis, then the expected value is n ■ 5.80x10^ sec-1. Therefore, the 
agreement is good. 

(c) Noise over low pass 

. . A2 

Autocorrelation function: <£(t) “ sin 2ltv gr T 

As a low pass filter there was used the storage magnetic tape, whose 
response curve at 16 kilocycles per second drops with considerable 
steepness (see fig. 2). 

(x) The continuous curve in figure 11 is the computed autocorrela- 
tion function for a low pass filter with a limiting frequency of 16 
kilocycles per second, with which the measured points, plotted as circles, 
show quite good agreement. 

(6) For the number of zero crossings the measurement with the counter 
gave n « 18x103 sec-1. The expected theoretical value would be 
n » 1.155 Vgp « 18.5xl0 3 sec"- 1 -, a value that agrees very well with the 
measured value. 

4. Cross Correlation 

Whereas the autocorrelation of a filtered noise, according to equa- 
tion (6), gives the simplified weighting function of the filter (without 
considering the phases), the weighting function itself is obtained If a 
cross correlation of the output voltage of the filter with its input 
voltage is taken. The cross correlation makes higher demands on the 
apparatus since it requires two accurately equal amplifiers whose phase 
shift also remains small for low frequencies. 

Cross-correlation curves for an RC filter (RC « 0.9x10-3 sec) and 
for an LC filter (v r « 1500 cps) are shown in figures 12 and 13. In 

contrast to the autocorrelation curves, which always start with their 
maximum value, the cross -correlation curves start with zero and, in 
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addition to the decay process, also show the starting oscillations of 
the filter. 


(b) position modulated impulse sequence 

As an example of a process very rich in overtones, impulse sequences 
were investigated. The impulses were taken from the decimal counter and 
could he released either periodically (through a sine voltage), or in an 
irregular sequence (through wabhling or noise voltage). Figure 14 shows 
the form of the impulses at the inlet of the multiplier; they could not 
he made arbitrarily narrow because the transmission line limited the 
frequency range at 25 kilocycles per second. 

The effects which the sequence of the impulses has on the autocor- 
relation function and on the Fourier spectrum are shown by the measure- 
ments presented in figures 15 and 16, respectively. 

In (a), the impulses are released by a sine voltage of 1425 cycles 
per second (i.e., purely periodic) . The sound analysis (fig. 16(a)), 
gives a spectrum of discrete partial tones, and the overtones of the 
inpulse give a sequence of frequencies v^. The autocorrelation function 

(fig. 15(a)), consists of Individual Jags which repeat periodically after 
the displacement time x « l/v^. The decrease In size of the Jags with 
increasing x is a measure of the accuracy with which the sequence of 
inpulse frequencies is controlled. In regular sequences of inpulses all 
the peaks should be of equal height; the decrease of the maximum In 
figure 15(a) is due to the damping of the transmission line. A comparison 
with figure 14 shows that the autocorrelation function has very great 
similarity to the time function. 

In the releasing of the impulses through a wabble tone (1400130 cps), 
the autocorrelation function changes very little; the Jags decrease some- 
what more rapidly with increasing t compared with those shown in figure 
15(a). The spectral analysis (fig. 16(b)) also contains the uneven 
partial tones which are drawn Into narrow frequency bands. 

If the impulses are released through a statistical process (using a 
noise-band width of Av/v « 0.6 so adjusted that the mean frequency of 
the impulse sequence was about 1380 cps), the Jags of higher order In the 
autocorrelation function (fig. 15(b)), disappear. From about x « 400 
microseconds on, §(t) e 0. The spectral analysis figure 16(c) shows 
(in place of the discrete partial tones), a wide continuous frequency 
band starting at about the mean frequency of the impulse sequence and 
decreasing very slowly after higher frequencies. 
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In summary,, It may lie said,, in regard to the autocorrelation function 
of impulse sequences , that the decrease of the autocorrelation function 
at x ta o is determined hy the content of the time function in high 
frequencies independently of the circumstance , whether it is a question 
of discrete overtones or wide frequency hands. In order to determine 
whether the time function is for a periodic or an aperiodic process, the 
autocorrelation function must be investigated up to relatively large 
values of x. Only if the autocorrelation function remains continually 
zero from a definite value of x onward does the time function correspond 
to an aperiodic process. 


(C) ADDITIVE SUPERPOSITION OF A PERIODIC AND AN 
APERIODIC TIME PROCESS 

Since, in the case of language sounds, for example, voiced consonants, 
we are frequently dealing with a superposition of two or more time 
processes, it will be shown in a further model test, how the Individual 
components can be separated with the aid of the autocorrelation analysis. 

The simplest and clearest case is that of the additive superposition 
of a pure sine tone f s (t) with a wide band noise f r (t). The autocorre- 
lation function In this case is (ref. 2) 

4>(t) - lim /* +T [f s (t) + f r (t)] [f s (t ±x) + f r (t ± t:)] dt 

T-*«o./-T 

- $ ssW + S’rrC'O + *^(*0 + * rs (*0 

The two terms with mixed indices drop away because^the cross correlation 
of two incoherent processes is zero. The autocorrelation of the sum of 
two incoherent processes is, -thus, equal to the sum of their autocorre- 
lation functions. For the particular case t a 0, we have 

S(0) - $ BS (0) + # rr (0) « fg(t) + f^(t) 

that Is, for x a 0, it is possible to obtain the square mean of the time 
functions from the autocorrelation diagram. 

As an example, figures 17(a) and (b) show the frequency spectrum 
and the autocorrelation function of a mixture of noise, and a sine tone 
of about 4 kilocycles per second. Both noises were so adjusted that they 
each produced the same deflection on a. tube voltmeter with linear recti- 
fication. According to Beranek (ref. 8), the mean square values are then 
In the ratio 

f2(t)/f|(t) -1.3 
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From the autocorrelation diagram, there can immediately he read the energy 
ratio 


«r(0) 


fg(t) 

«fct) 


74 - 31 
31 


to 


1.4 


In order to compute the sine and noise components of the frequency 
spectrum, the filter width of the sound analyzer and the hand width of 
the noise must he knownj ■ in the case considered they are 15 cycles per 
second and 16 kilocycles per second, respectively . From figure 17 (a) 
there is then obtained 

f2(t)/f2(t) -1.5 

In a second measurement, in which the energy of the sine tone was only 
one-fourth of the energy of the noise, the following values were obtained 
for the ratio of the two energies: theoretical value, 5.0; autocorrela- 

tion analysis, 4.8; and sound analysis, also 4.8. The accuracy of the 
measurement is about equal for both processes. 

In the case of voiceless consonants we have to deal not with the 
superposition of a pure sine tone and a very broad banded noise, but 
frequently with the superposition of two relatively narrow but continuous 
frequency bands. As an example, the results are given for the voiceless 
"s”, whose spectral analysis is shown in figure 18(a), and whose auto- 
correlation function is shown in figure 18(b). If it is desired to 
determine, from the frequency spectrum, what part of the total energy 
corresponds to the small peak at 7 kilocycles per second and what part 
of the total energy corresponds to the broad frequency range at 9 kilo- 
cycles per second, the areas under these peaks must be measured with an 
integrating planimeter. This measurement, which was carried out with 
an enlargement of figure 18(a), gave, for the ratio of the energies, 

n 9 kcps/ N 7 kcps * 6 * 2 * 

The separation of the two autocorrelation functions can easily be 
effected if the maxima and minima of the measured autocorrelation func- 
tion is plotted to logarithmic scale (fig. 18(c)). In spite of the 
large scatter of the measured values, two straight lines which character- 
ize the exponential decrease of the two oscillations can easily be drawn. 
The steep line corresponds to the broad frequency band, and the line with 
smaller slope corresponds to the narrow band. From the points of inter- 
section of the straight lines with the axis of ordinates there is read, 
for the ratio of the energy of the two peaks at 9 kilocycles per second 
to that of the narrow peak at 7 kilocycles per second: 


$9 kcps(°) 90 - 14.5 
*7 tops W 14 - 5 


5.3 
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The agreement of the two values is satisfactory in view of the fact that 
the planimeter measurement is not very accurate. 

(d) VOICED AMD VOICELESS SIBILANT SOUNDS 

Voiceless sibilant sounds consist of more or less broad, continuous 
frequency bands whose shapes depend considerably on the position of the 
mouth in uttering them. If, for example, the mouth is in the position 
for pronouncing u while the sibilant is uttered, the latter will gen- 
erally contain only a single, relatively narrow frequency range. 

Voiced sibilants have essentially the same frequency spectrum as 
the corresponding voiceless sounds except that there is, in addition, 
the much more energy rich, purely periodic voice tone. 

According to Meyer-Eppler (ref. 9), there is defined, as a measure 
of the degree of voicing S of a sound, the ratio 

s- ^p(o)/*(o) 

where 4>p (0) is the autocorrelation function of the periodic component 
and #(o) that of the total sound, both taken for T » 0. 

The sibilant investigated was recorded on an endless recording loop 
of about 1 meter length in such manner that It filled out the tape except 
for the unavoidable gap of 5 percent of the tape length. To determine 
the number of zero crossings with the counter, the gap of the recording 
was taken into account by counting over a longer time, measured with a 
stop watch (about 10 cycles of the loop) and then multiplying the In- 
dicated number of zero crossings by the factor 1.05. 

In the following paragraph one example of the large number of meas- 
urements for each of the Investigated sibilants (table III) will be dis- 
cussed. Further, an attempt will be made to distinguish the character- 
istic properties of the individual sounds . 


Voiceless Sibilants 


1. Voiceless "sh" 

(a) The spectral analysis (fig. 19(a)), gives a continuous spectrum 
without discrete lines. It possesses two preferred frequency ranges, a 
high, relatively narrow peak at 5.3 kilocycles per second (half -value 
width about 300 cps), and a smaller peak at 1.2 kilocycles per second. 
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(p) The autocorrelation, curve (fig. 19(h)), has the form of a damped 
cosine oscillation. From the distance of the first six maxima there is 
obtained a period t = 185 microseconds, corresponding to a frequency of 
5.4 kilocycles per second (sound analysis 5.3 kcps) . On the fundamental 
period is superposed a small secondary period whose value is obtained 
from figure 19(b) as about 740 microseconds (1.3 kcps). It correponds 
to the small peak at 1 kilocycle per second in figure 19(a). To deter- 
mine the half -value width of the fundamental period, the extreme values 
of the autocorrelation function were logarithmically plotted in figure 
19(c). In spite of the considerable scatter of the measuring points 
(caused by the superposed secondary period), it can be seen that the 
points lie approximately on a straight line, and hence, that the auto- 
correlation function decreases exponentially and not according to the 
slit function sin a/c/ arc . From this, it can be concluded that the reso- 
nator formed by the mouth cavity in speaking the "sh" sound has the 
properties of a simple mass-spring system, and is not a filter with rec- 
tangularly limited frequency range. The half -width value of the resonator 
is computed from the drop of the autocorrelation function, with account 
taken of the losses of the transmission line, and is found to be 360 
cycles per second (spectral analysis 300 cps) . The point $ (t = 0) lies 
uniquely above the straight line which is passed through the remaining 
measuring points. This fact indicates that, in addition to the preferred 
frequency ranges of 1.2 and 5.4 kilocycles per second, there exists a 
further wide noise background whose autocorrelation function drops off 
so rapidly that It has no effect on the remaining measuring points. In 
this noise background, which can also be seen on the frequency diagram 
(fig. 19(a)), there is contained, as can immediately be read from the 
autocorrelation diagram, about 35 percent of the total energy of the 
sibilant sound. 

(y) In the measurement with the decimal counter, there was obtained 
a mean number of zero crossings of 9.8*103 sec - ^-, corresponding to a 
central frequency of 4.9 kilocycles per second. This value lies about 
8 percent lower than was expected according to measurements (a) and (p) . 

Measurements of other "sh" sounds, some of which were spoken by one 
person for various positions of the mouth and some by seven different 
persons, showed the following common properties that appear to be char- 
acteristic for the "sh" sound. The frequency spectrum often possesses 
two preferred frequency ranges each having a width of 500 cycles per sec- 
ond. The upper region generally lies between 5 and 6 kilocycles per sec- 
ond, the lower region between 1 and 3 kilocycles per second. Since, in 
uttering the "sh" sound, two cavities axe formed by the position of the 
tongue, we are possibly dealing with the resonances of these two cavities. 

2. Voiceless "s" 

The Investigations on a voiceless "s" sound (figs. 18(a), (b), and 
(c)), have already been discussed in the preceding section. Hence, only 
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the values obtained, for the central frequencies and damping of the two 
preferred frequency ranges, according to'the different methods, are 
collected here: 

(a) Spectral analysis: 

v 1 = 7.0 kcps 
v g = 8.8 kcps 

A v ]_/ v i = 200 cps 
AVg/Vg * 1500 cps 

(p) Autocorrelation: 

v = 7.4 kcps 
Vg s= 8.7 kcps 
= 160 cps 
Avg/vg ss 2000 cps 

Vg was computed from the distance of the first maximum of the autocorre- 
lation diagram, and was computed from the distance of the last of 

the 19 measured maxima (in fig. 18(h), only the first eight periods are 
drawn in) . 

(r) Counter measurement: The counter, naturally, does not distinguish 

between the two frequencies. There were counted 18*10*^ sec - ^ zero cross- 
ings per second corresponding to a central frequency of 9 kilocycles per 
second. Measurements a, 0, and y are, therefore, particularly as 
regards the frequencies^, in good agreement with each other. 

Almost all "s” -spectra (spoken by seven different speakers) have 
tlie common characteristic that frequencies below 5 and above 10 to 12 
kilocycles per second, are completely absent. Between 5 and 10 kilocycles 
per second there is a sharply limited, very strong cleft frequency band 
which evidently consists of several relatively narrow frequency bands. - 
In agreement with this, the autocorrelation diagram of "s" sounds gen- 
erally has a relatively undamped character. 


5 The differences mentioned in section 4 (a) of the characteristic 
frequency measured with the sound analysis and with the autocorrelation 
analysis, is not taken into account here. 
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3. Voiceless "f " 

(a) The sound analysis (fig. 20(a)) gives a very "broad continuous 
spectrum which practically fills the entire investigated frequency range 
(up to 16 kcps) . Individual, very narrow peaks project somewhat from 
the noise hand. 

(p) The autocorrelation function (fig. 20(b) shows a section from 
0-l*lCr 3 sec), is unusually strongly damped. At the second maximum 
(r * 86 p sec) , the autocorrelation function amounts to only 10 percent 
of the value of #(o) . For t > 100 microseconds, the autocorrelation 
function is a very weakly damped, somewhat irregular cosine function 
whose period correponds to a frequency of about 11 kilocycles per second. 
The waviness is not completely damped even at t = 2000 microseconds. 

The jag of the spectrum to which this very undamped oscillation corre- 
sponds could not be explained. The logarithmic representation of the 
measured autocorrelation function showed that, in this periodic component 
only, about 8 percent of the total energy is contained. 

(y) Counter measurement: The mean number of the zero crossings 

amounted to 19.6*10 3 per second, that is, the frequency center of gravity 
lay at about 10 kilocycles per second. 

Characteristic of all investigated "f " sounds (seven different 
speakers) was the wide frequency band and the rapid drop of the autocor- 
relation function. Through suitable mouth position it was possible to 
stress Individual frequency ranges, but the proportion of the total energy 
in the wide band was seldom below 75 percent. 


Voiced Sibilant Sounds 

4. The "zh" Sound (Voiced "sh n as in French "journal") 

(a) The spectrum of the voiced "zh" sound (fig. 21(a)) consists of 
a continuous part which is equal to the spectrum of the corresponding 
voiceless "sh" sound and the discrete lines of voice tone and its over- 
tones. (The amplitude of the vocal tone is much greater than that of 
the noise component so that the spectrum, fig. 21(a), had to be plotted 
to logarithmic scale.) The voice tone lies accurately at 390 cycles per 
second (determined with the decimal Counter) , the preferred frequency 
ranges of the noise component at 2.5 and 6 kilocycles per second. 

(fi) Figure 21(b), represents the measured autocorrelation function, 
which is additively composed of three curves. In a simple manner, there 
can be split off a cosine curve of the period T = 26 microseconds 
(v = 382 cps), which correponds to the voice tone (<i> s (tt) in fig. 21(b)). 
The rest of the autocorrelation curve (fig. 21(b)) splits into a periodic 
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part with x = 162 microseconds (6.2 kcps) and a noise part. Okie total 
energy is distributed, as is read from figure 2l(b), as follows: voice 

tone, 73 percent; frequency band at 6 kilocycles per second, 5 percent; 
and noise part, 22 percent. The degree of voicing "is 

S = a/65/89 =» 0.85 

(y) The determination of the zero crossings with the decimal counter, 
as in the case of all voiced sounds, is impossible here because the rel- 
atively small noise component, along with the strong voice tone, cannot 
be reliably determined. 

5. Voiced "s" 

(a) The sound analysis (fig. 22(a)) also shows, in addition to the 
discrete lines of the voice tone and its overtones, a continuous spectrum 
very similar to that of a voiceless "s". The preferred frequency range 
lies at 9 kilocycles per second, and has a half -value width of about 1.6 
kilocycles per second. The frequency of the voice tone, determined with 
the decimal counter, was 350 cycles per second. 

(g) The autocorrelation curve (fig. 22(b)) can very easily be Bplit 
into two curves. The curve with. the longer period, 3100 microseconds 
(322 cps), corresponds to the voice tone, whereas the other curve repre- 
sents a damped cosine function of the period x m 115 microseconds (8.7 
kcps). The frequency band width computed from this oscillation is 1.5 
kilocycles per second. The data of the two oscillations, computed from 
the autocorrelation function, thus agree very well with the results of 
the sound analysis. The degree of voicing of the sound can be read most 
simply from the autocorrelation curve 


S = V5 7/89 = 0.80 

6. Voiced "v" 

(a) The spectrum contains a very strong discrete line at 335 cycles 
per second (fig. 23(a)). The noise component is a very weak noise band 
that extends over the entire frequency range of the apparatus. 

(p) The measured autocorrelation curve (fig. 23(b)) shows the same 
result. The voice tone has a period x = 3200 microseconds (310 cps), 
and contains 96 percent of the total energy. It is superposed by a very 
strongly damped cosine vibration whose period is about x » 70 micro- 
seconds (14 kcps) . 


4191 



CM-4 4191 


NACA TM 1404 


25 


Degree of voicing: 


S = zy/90/94 = 0.98 


Short Sibilant Sounds 

In addition to sibilant sounds, which were held for sometime, briefly 
uttered voiceless sibilant sounds were also investigated. The autocor- 
relation curves showed great similarity with those of the corresponding 
longer uttered sounds. Characteristic differences could not be 
established. 

The author wishes to thank Professor E. Meyer for his suggestions 
and valuable advice. 
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a The percentages give the deviations of the measured values from the 


computed value. 

^The curve is so strongly damped that neither the resonance frequency nor 
the half -value width could he obtained from it. The measured values, however, 
agree well with the computed curve (fig. 8(a)). 


TABUS III 


LIST 0? nmBTIGATED BIBHAHTS (and BBICATIVTS) 
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Figure 2. - Frequency curve of storage magnetic tape. 
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Figure 3. - Characteristic curve of over-all _ 
autocorrelation apparatus. 
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(a) Frequency response curve of filter, 

o o o measured with, pure tones, 

simplified curve used for computing 
autocorrelation function. 


Ob) Spectrum of filtered noise. 




(c) Autocorrelation function; computed from figure 9(a); 

o o o measured for + t; xxx measured for - T . 

Figure 9. - Noise through, rectangular filter of octave vidth. 
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(h) Release of impulse through noise voltage. 


Figure 15. - Autocorrelation functions of a sequence of impulses. 
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(c) Release of impulses through noise voltage. 
Figure 16* - Sound analysis of sequences of impulses. 




















(b) Above: measured autocorrelation curve . 

Below: autocorrelation curve of noise component 


Figure 22. - Voiced "s" ("z”). 






